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ABSTRACT 

In the future, space structures much larger than those of today are 
expected. The analyses of such structures will have computational require- 
ments considerably greater than what is currently being realized. To gain 
some understanding of those requirements, an investigation into the process- 
ing characteristics of NASA/JSC's primary structural analysis computer 
program, NASTRAN, has been conducted. Based on the outcome of that 
investigation which resulted in a model sensitive to various NASTRAN host 
systems and workload scenarios, a set of recommendations based on cost/ 
performance considerations has been proposed. 
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• . SECTION 1 

INTRODUCTION 

1.0 BACKGROUND 

NASTRAN (for NASA STRUCTURAL ANALYSIS) is a general pur- 
pose system of computer software used for solving a wide variety of engineer- 
ing analysis problems by means of the finite element displacement method. At 
Johnson Space Center (JSC) NASTRAN is used primarily for performing static 
and dynamic structural analyses of the Space Shuttle.* The host system for 
JSC's processing is the UNIVAC 1110 (U1110). Because of NASTRAN's 
current impact on the U1110, and because even larger space structures than 
the Shuttle loom as future prospects, MITRE has been directed by the 
Institutional Data Systems Division (I DSD) at JSC to address these future 
structures in terms of their effect on future computing facilities. 

1.1 Objectives 

The objectives of this task are to: 

' (1) Gain a sufficient understanding of the processing behavior of 

NASTRAN in order that cost versus performance tradeoffs can 
be conducted for various computing systems. 

(2) Study candidate systems in order to determine the most cost 
effective way for supporting NASTRAN at JSC in the future. 

(3) Study the addressing constraints of UNIVAC systems and the 
effect of these constraints on the NASTRAN user and on job 
turnaround time. 


* 


For a more complete description of NASTRAN's capabilities, see [lj. 
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1.2 Organization of Report 


In Section 2 the processing implications of structural analysis are 
addressed in order to lay a foundation for further developments. Section 3 
addresses Hie current NASTRAN situation on the U1110. Section 4 describes 
the current J SC NASTRAN workload and estimates the effect of workload 
growth due to increased structure size. A cost/performance analysis is 
conducted in Section 5 t and Section 6 summarizes the findings of this study. 
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SECTION 2 

STRUCTURAL ANALYSIS PROCESSING 

2.0 INTRODUCTION 

The mathematical formulation of a structural analysis typically 
involves the use of large matrices of high precision data. In the following 
paragraphs, some basic comments about matrices are presented in order to 
lay the foundation for later developments. To illustrate the translation of a 
structural analysis problem into its mathematical state, the solution of a 
simple statics problem is discussed in Appendix I . 

2.1 Matrix Mathematics 

A matrix can bethought of as a rectangular arrangement of numbers; 
for example. 



’l 3 5* 


10 2 3 

A = 

0 6 2 
1 0 3 

, B = 

8 3 3 1 

0 0 5 2 


The size of a matrix is its order. The order of A is 3 rows by 
3 columns (or 3 by 3); the order of B is 3 by 4. A set of rules for matrix 
arithmetic exists and includes such common operations as addition, subtrac- 
tion, multiplication, and "division" (multiplicative inverse). For a discus- 
sion of matrix algebra/arithmetic the reader is referred to (2]. 

In terms of system storage, a matrix of order n by m has n x m data 
elements. Because of the large number of arithmetic operations performed in 
structural analysis (discussed in Section 2.2), accuracy problems due to 
truncation and error propagation require that floating point data values con- 
sist of about 54 bits [3]. This requirement generally implies double precision 
on most systems. 
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In terms of classical mathematics, the addition of two matrices, each 

of order n by m, requires a number of arithmetic operations proportional to 

n x m; the multiplication of two matrices of orders n by p and p by r requires 

a number of operations proportional to n x p x r. To invert (perform the 

multiplicative inverse of) a matrix of order n by n, the number of operations 
3 

is proportional to n . When one interprets the storage and computational 
requirements associated with large matrices whose orders may easily be in 
the tens of thousands for static analyses, it becomes readily apparent that 
the problems can be of enormous size. Hundreds of millions of computer 
words and billions of operations are suggested by the above. However, in 
the world of NASTRAN (and other structural analysis programs), the charac- 
teristics of these matrices are usually expoued to make the situation much 
more tenable than it might appear. Specifically, these characteristics have 
to do with the very large proportion of zero valued elements in all structural 
analysis matrices. In fact, it is not uncommon for the density (of nonzero 
elements) of the matrices to be less than 1%. In such a situation, the matrix 
is said to be sparse . NASTRAN employs the use of sparse matrix data 
structures, thereby eliminating the need to physically represent zeros; and 
in conjunction with these data structures, utilizes sparse matrix computational 
techniques. Even though an "elimination" of 99% is suggested, the typical 
structural analysis requires several matrix operations and the amount of 
storage and the number of computer operations can still be quite formidable. 

2.2 Execution Characteristics of NASTRAN 

In general, NASTRAN has a multitude of capabilities and at JSC it 
is used in several ways. However, since JSC’s workload is dominated (80%) 
by static analyses which, for large structures, are dominated (75%) by the 
symmetric decomposition (SDCOMP) of the stiffness matrix, the execution 
characteristics to be discussed will be those of SDCOMP. 
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The stiffness matrix is a banded, symmetrical matrix. The quality 
of symmetry means the matrix remains unchanged when the rows and columns 
are interchanged. A banded matrix is one in which the nonzero terms are 
clustered about the diagonal. Figure 2-1 illustrates the stiffness matrix. 


$x 


XX 

XXX 

XXX 

x ¥ 


XX f J 

XXX v 

XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 
XXX 

l XXX 

> XXX 

XXX 
XXX , 
XX3d 

xxf 


Figure 2-1 Stiffness Matrix 


The decomposition o£ the stiffness matrix is directed to the solution 
of a system of linear equations. The symmetric decomposition algorithm used 
by NASTRAN works very efficiently with a memory scratch pad area whose 
size is dependent on the semi-bandwidth (or, in NASTRAN terminology, the 
number of active columns) of the matrix. This scratch pad can be of varying 
sizes; when the scratch pad is less than a certain size NASTRAN has to 
employ spill logic which, in effect, is a tradeoff between I/O activity and memory 
space . 

2.2.1 SDCOMP CPU Requirements 

The CPU requirements for symmetric decomposition can be approxi- 
mated by the relationship, T=0.5 x m x N x C [4], where m is a system- 
dependent , experimentally determined constant which represents tne amount 
of CPU time required to make one pass through SDCOMP's tight multiply-add 
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loop which dominates computation; N by N is die order of the Matrix*, and C 
is the average number of active columns per row. Taking a to be 6 micro- 
seconds for the UNIVAC 1! 10 (see Appt idix 11), and plotting T with C = N/25 
(based on current workload characteristics), results in the curve shown in 
Figure 2-2. 



From Figure 2-2, it follows that if the structure size doubles, 
the CPU time required by SDCOMP increases by 700%; if the structure size 
triples, the CPU time required by SDCOMP increases by 2600%; and so on. 

2.2.2 SDCOMP I/O Requirements 

The decomposition of a symmetrical matrix can proceed very 
efficiently when a memory scratch pad area of approximately C^/2 data values 


In structural analysis this is referred to as N degrees of freedom (dof). 
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is available [41. where C is the maximum number of active columns in any 
row. Assuming that NASTRAN's instruction space plus I/O buffers is 
45000 words (see Appendix II), the no-spill , double precision memory 
requirements for the UNIVAC 1110 would be as shown in Figure 2-3 for 
the case where C = N/25. 



Figure 2-3 SDCOMP: No-Spill Memory Space vs. Structure Size 


Before W exceeds the maximum address limit of the program, spill must be 
employed. For the above, spill becomes a necessity between 11000 dof and 
12000 dof because of the UlllO's limit of 262,000 words per program. 

In the event that sufficient memory cannot be obtained to avoid spill, 
SDCOMP can proceed, in most cases, with whatever amount of memory is 
available. Again, the tradeoff is between I/O activity and memory space. 
The spill curve has the characteristics shown in Figure 2-4. 
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a 


b 


c Memory 


Figure 2-4 SDCOMP Spill Behavior: I/O vs Memory 

The point 'a* represents the minimum amount of memory with which SDCOMP 
can proceed. The point 'b' is meant to suggest that there exists a system 
optimal amount of memory. And beyond point 'c' where spiH is eliminated, 
excess memory would be dedicated to I/O buffers and might be less beneficial 
than the reduced I/O charges would justify. 

2.3 Reducing the Impact of NASTRAN on the Computer 

Based on the preceding discussion, it follows that the bandwidth of 
the stiffness matrix is the main determinant of system impact. The following 
paragraphs look at ways to reduce this impact. 

2.3*1 Bandwidth Reduction 

For the purposes of its mathematic *1 description, the nodes of a 
structure can be numbered arbitrarily. For a given structure, all numbering 
schemes lead to the same s’-e stiffness matrix and the same number of nonzero 
terms; however, different numbering schemes lead to different arrangements 
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of nonzero terms. That arrangement which produces the minimum average 
bandwidth will result in the most favorable computing time. Two popular 
computer programs available to NASTRAN users for this purpose are 
BANDIT [5] and WAVEFRONT [6]. 

2.3.2 Substructuring 

Applicable to both static and dynamic analyses, substructuring is 
the division of a large single structure into component structures. Each 
component (or substructure) is analyzed separately and finally all are recon- 
nected for analysis of the total structure. In addition to providing an 
approach to analysis of a structure too large for efficient solution on the 
computer, substructuring offers the advantage of working with smaller and 
more manageable parts , each being independent of the others . [7] 

The general procedures used in sub structuring analysis can be 
divided into three distinct phases: 

Phase I - NASTRAN analysis erf each unique structure to produce, 
in matrix form, a description of each substructure in terms of a reduced set 
of degrees erf freedom that , at a minimum , include the boundary degrees of 
freedom that connect to adjacent structures. 

Phase II - Combination of matrices from Phase 1 with any addi- 
tional structure that the user may wish to define , and the subsequent analysis 
of the "pseudo structure." 

Phase III - The results from Phase I and Phase II are utilized to 
obtain detailed data from the individual substructures. One computer run is 
required for each substructure considered in Phase III. 

The substructuring procedure described above involves the execu- 
tion of 2k+l computer runs, where k is the number of substructures. 
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2.4 Host Computing Systems for NASTRAN 

In so far as NASTRAN host systems are concerned, there are 
several systems ranging from mini-computer to super computer which could 
be an adequate system for NASTRAN. There is currently a great deal of 
enthusiasm for the mini -computer [8], especially among engineering groups, 
and there is little doubt that super computers such as the CDC CYBER 203B 
Csuperceder of the STAR) and CRAY I , by virtue of their vector processing 
capabilities, would be powerful NASTRAN hosts. However, to qualify for 
consideration in this study, it was not whether the system had potential, but 
whether the system had an operational and supported version of NASTRAN. 

It was not considered reasonable for JSC to underwrite or perform the con- 
version of NASTRAN to another system. Such an activity would be very 
costly from the standpoint of both initial conversion and continuing maintenance/ 
enhancement. In the past, NASA (Langley Research Center) has sponsored 
several studies [9,10,11,12] to determine the feasibility of converting 
NASTRAN to various super computers. None of the studies has resulted in 
NASA's funding such a conversion. One of the problems in justifying such a 
conversion and continuing maintenance is the cost of the effort versus the 
relatively small number of prospective host systems. (To give better perspec- 
tive to this point, NASTRAN consists of between 400,000 and 500,000 
FORTRAN statements.) 

Table 2-1 shows the host computing systems for NASTRAN consi- 
dered in this study. 


Table 2-1 Host Systems for NASTRAN 


VENDOR 

COMPUTER SERIES 

CDC 

DEC 

IBM 

UN1VAC 

CYBER 17X: 172 m 176 

VAX - 11/780 

370/303X: 3031, 3032, 3033 

1110; 1100/8X: 1100/80-*! 100/84 
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At the present time the DEC VAX-11/780 version is under development and 
is not available for release. The VAX is represented in this study since it 
is expected that a version of VAX NASTRAN will be available from the 
MacNeal-Schvendler Corporation (MSC) during Spring 1979*. It should be 
noted that in the case of the IBM version, those systems which can execute 
IBM-compatible software would extend the vendor range of Table 2-1. 

The following paragraphs provide a brief description of the systems in 
Table 2-1. For more detailed descriptions see £133 and/or product informa- 
tion available from the vendors . 

2.4.1 CPC CYBER 17X 

The CYBER 170-series which spans the medium-to-large class of 
computers is the latest evolution of an architecture that began with the 
CDC 6600. The 171 and 172 can be either unit processors or dual processors; 
the 174 is a dual processor 173, while the 175 and 176 are unit processors. 

For all of these systems the architectural philosophy is to perform computa- 
tion with the central processor(s) (CP) and to distribute the slower, lower- 
level functions such as input/output and system control among multiple peri- 
pheral processors. The central memory (CM) of these systems is made up of 
60-bit words and can consist of as many as 262,144 words of which no more 
thap 131,072 can be used by a single FORTRAN program. Optionally avail- 
able is an extended core storage (ECS) which can consist of up to 2,097,152 
words. On the 176 a minimum ECS complement of 524,288 words is required. 
ECS is different from CM in that it cannot be executed from directly; in 
general, the system uses ECS as an ultra-high-speed I/O device. The 176 
can use ECS in a more flexible way than can the other systems, but still ECS 
on the 176 is nowhere near a full-fledged extension of CM. From 171 through 
174, the CPs consist of a single unified arithmetic unit that executes all 
instructions. The 175 and 176 CPs employ nine independent functional units 

* 

Concurrently with and independent of MSC, another version of VAX NASTRAN 
is being developed by the Goddard Space Flight Center (GSFC). This version 
is scheduled for completion during FY80. 



that perform specialized operations thereby allowing a high degree of con- 
currency . Also, on the CPs of the 175 and 176 there is an instruction stack 
similar to a cache memory which can hold between 14 and 48 instructions. 
For loops which can fit entirely within the stack, very good performance can 
result. 


2.4.2 DEC VAX-11/780 


The DEC VAX-11/780, which is an extension of the PDP-11 family 
of computers, is generally regarded as a "supermini". The VAX/VMS 
virtual memory operating system provides a virtual address space of over 
4 billion bytes (byte = 8 bits). The real memory complement of the VAX can 
presently be as large as 8 million bytes. The processor includes an 8K byte 
cache memory. 

2.4.3 IBM 370/303X 

The IBM 370/303X family of computers includes the 3031, 3032, and 
3033- These virtual memory systems can accommodate programs whose 
individual virtual sizes can be as large as 16 million bytes (byte = 8 bits) 
when operating under MVS (Multiple Virtual Storages). The real memory 
complement of the 3031 and 3032 can be as large as 6 million bytes, whereas 
the real memory size of the 3033 can go up to 16 million bytes. For the 3031 
and 3032, a buffer storage of 32,768 bytes is available. On a unit processor 
3033 this buffer storage can be as large as 65,536 bytes. 

2.4.4 UNIVAC 1110 and 1100/8X 

Although the 1110 is a discontinued system as far as the current 
UNIVAC product line is concerned, it was included in this study because it 
will remain as part of the IDSD Central Computing Facility (CCF) for the 
foreseeable future. The CCF U1110 is a 2x2 configuration (i.e. , 2 CPUs 
(CAUs) and 2 I/O processors) with 524,288 words of memory (262,144 primary 
and 262,144 extended.) Due to an 18— bit address limitation, a single program 
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can, at a given instant, access no more than 262,144 words (262K words). 

While access to more than 262 K words is possible through a system feature 
called program b anking, the maximum instantaneous address space is still 
262 K words. 

The UN I VAC 1100/8X-series (including /80 through /84) represents 
UNIVAC's latest offering in the large system category. Except in the case 
of /80 which is a unit processor, the x (of 8x) signifies the number of CPUs 
on the system . An optionally available Scientific Accelerator Module (SAM) 
significantly increases the execution speeds of floating point arithmetic opera- 
tions. One of the most distinguishable differences between the /8X-series 
and its predecessors (1106, 1108 and 1110) is the use of a high-speed buffer 
storage in the Storage Interface Unit (SIU) connecting a large backing store 
of moderate speed. The maximum buffer storage size is 32,768 words while 
the backing store can be as large as 16 million words. However, as with the 
U1110, the single program can, at a given instant, access no more than 
262,144 words. 

2.5 Versions of NASTRAN 

NASTRAN's original development was sponsored by NASA's Goddard 
Space Flight Center (GSFC) during the late 1960's. Once completed, 
NASTRAN's maintenance and enhancement became the responsibility of the 
NASTRAN System Management Office at the Langley Research Center. During 
early 1979, Langley will cease this function to be assumed by COSMIC (NASA's 
Computer Software Management and information Center located at the Univer- 
sity of Georgia) which for several years, has had the role of distributing 
NASTRAN. Other versions of NASTRAN have been spawned from the earlier 
NASA version. Perhaps the best known of these versions is that marketed by 
the MacNeal-Schwendler Corporation (MSC) which was part of the original 
development team. Universal Analytics Incorporated (UAI) market feature 
and performance enhancement packages for the COSMIC version. JSC is 
currently using the COSMIC version with local performance modifications 
resulting from an earlier MITRE study [14] • 
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It has not been a goal of this study to compare the different versions 
of NASTRAN. However, a task within I DSD is currently concerned with 
selecting the most cost effective version of NASTRAN for the U1110. 

2.6 NASTRAN User Survey 

To de ne what the NASTRAN world outside JSC was like, a 
limited survey of other NASTRAN user groups was conducted. The intent 
was to find out what computing combination (computer system and version of 
NASTRAN) was used by other engineering groups which are generally involved 
with structures similar in size to those studied at JSC. Table 2-1 1 presents 
the findings of that survey. It is of some interest to note the absence of 
UNI VAC systems and the predominance of MSC NASTRAN among the private 
industry component of those surveyed. 
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Table 2—11 NASTRAN 

User Survey _ 

USER 


COMPUTER SYSTE 


COSMIC * 

MSC 

CDC 

IBM 

UN I VAC 


Boeing Company 


n 

CYBER 175 

370/168 



Ford Motor Company 



CYBER 175 



>- 

04 

General Motors 


H 


370/168 


H 

c/> 

General Dynamics 


IS 

CYBER 172 



o 

z 

Grumman Aerospace Corporation 

X 


CYBER 72 

370/168 


l-H 

w 

Lockheed - California 

X 



3033 


H 

< 

Martin Marietta 


X 

CYBER 172 

370/xxx 


> 
1— t 

04 

(X 

McDonnell Douglas 


X 

CYBER 175 

370/168 


Northrop 


X 


3033 



Rockwell 

X 


CYBER 176 

3033 



Ames Research Center 


X 

7600 




Dryden Flight Research Center 

X 


6600 




Goddard Space Flight Center 

X 

X 


360/95 



Jet Propulsion Laboratory 

X 




1108 

< 

to 

Johnson Space Center 

X 




1110 

< 

z 

Langley Research Center 

X 


CYBER 175 




Lewis Research Center 

X 




1110 


Marshall Space Flight Center 

X 




■ -LLQ2 


* Includes derivatives of COSMIC version. 






SECTION 3 

ADDRESSING LIMITATIONS ON THE U1U0 

3.0 BACKGROUND 

At the outset of this task the poor turnaround time typically realized 
by large NASTRAN jobs at JSC was blamed to a large extent on the addres- 
sing limitations of the U1110. The purpose of this section is to better explain 
those problems. As background to this explanation, it is useful to realize 
that many of the programs executed on the U1110 are quite large in size, and 
that the aggregate workload is considered heavy. 

3.1 NASTRAN's Problems on the U1110 

Since the memory requirements of current large NASTRAN jobs are 
approximately half of the 262,144 word limit allowing for moderate spill, it 
follows that addressing limitations are not responsible for any current prob- 
lems. These problems appear to be more related to the competition for mem- 
ory which is a function of (a) the aggregate memory requirements of the active 
job set, Cb) the amount of memory available to the active job set, and possibly 
(c) the scheduling philosophy of the operating system, EXEC 8. This compe- 
tition results in a large amount (often days) of system wall clock time for the 

job. Since the combined CPU and I/O requirements of a large NASTRAN job 

* 

are currently less than three (3) SUP hours, the greatest part of that job's 
life is spent waiting. If a job is at some stage of processing short of comple- 
tion, and if the system should go down for any reason (e.g. , system crash, 
preventative maintenance, or block time), whatever work had been done in 
behalf of the job is lost and will have to be repeated through a restart of the 
job. NASTRAN's users say that restarts are a common occurrence. To test 
the premise that a higher priority would "push" the job through the system 
more quickly, a fairly large job (100K words of memory, 80 SUP minutes) 

*Standard Unit of Processing. Used for resource accounting. 

17 


r r « 1 l f _i torao»u* 



was run as deadline batch with a goal completion time of the +hen current 
time. The experiment took place on a normally loaded system and completed 
in about three hours of wall clock time. While no assessment can be made of 
the adverse effect, if any, on the other jobs in the system, it does seem 
reasonable to suspect that a higher priority for NASTRAN would not only 
improve the turnaround of large NASTRAN jobs, but would also reduce the 
wasted computation due to restarts, and generally contribute to greater over- 
all system productivity. For further details on deadlining to improve system 
productivity, the reader is referred to [15] which discusses a UNIVAC 1108- 
related performance analysis. 


Deadline batch is a type of batch run whose priority is dynamically adjusted 
so as to realize a preset job completion time. The priority is a function of 
the run card-specified maximum SUPs, the current time, and the specified 
completion time. 
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SECTION 4 


JSC NASTRAN WORKLOAD DEFINITION 

4.0 INTRODUCTION 

lo accomplish the planning aspects of this study, it was necessary to 
identify future NASTRAN processing requirements. Since little was known 
abou. future space structures except i t they would be significantly larger 
than the Space Shuttle, arbitrarily defined workloads had to be used. The 
following paragraphs describe the workload modeling approach used in this 
study and the processing expectations for these workloads on the prospec- 
tive host systems. 

4.1 Present NASTRAN Workload 


Presently, the NASTRAN workload on the U1110 can be described 
as a set of small NASTRAN jobs and large NASTRAN jobs. Each of the small 
jobs consumes approximately 20 SUP minutes at a CPU:IO ratio of about 3:1 
and requires about 65,000 words of memory. Approximately 20 to 25 small 
jobs per week are submitted. The large job class consists of 2 to 3 jobs per 
week. Since static analyses account for 80% of these jobs, the large job 
class can be represented by static analysis jobs whose individual character- 
istics involve a stiffness matrix of 10,000 dof and 400 active columns.* 

4.2 Future NASTRAN Workload 

If the structure being studied increases in size by 3 ' "or of m, it 
is expected that the small jobs will not increase in size, hit rather, will 
increase in number by a factor of m. In the case of the large jobs, their 
quantity (2 to 3 per week) will remain fixed, but the size of the stiffness 


The dvnamic analyses accounting for the other 20% have similar execution 
characteristics, in terms c r memory requirements and SUP consumption, 
to those of the static analyses. 
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matrix and the number of active columns will be affected by the factor of m. 
This treatment of the active columns is felt to be conservative. 

Thus, for an eventual structure whose size is m times the current 
size, the workload will be as follows: 

Small lobs: 20 x m to 25 x m per week , each requiring 

• 65,000 ■ ords of memory 

• 15 U1110 CPU minutes (3/4 of 20) 

• 5 U1110 I/O minutes (1/4 of 20) 

Large Jobs 2 to 3 per week, each involving a stiffness matrix 
of m x 10,000 dof and m x 400 active columns. 

4.2.1 Small Jobs 

In order to evaluate the impact of small jobs on host systems other 
than the U1110, the following assumptions were made: 

(1) An amount of memory equivalent to 65,000 U1110 words would be 
required , 

(2) The I/O power on the other systems being considered would be 
the same as the I/O power of the U1110, and 

(3) The CPU requirements on the other systems would be related to 
the U1110 in terms cf the assumed power ratio ranges shown in 
Table 4-1 . 


Table 4-1 Relative CPU Power 


’■ r ""t 

SYSTEM 

CPU POWER RANGE 

CDC 

CYBER 173 

0.8 to 1.3 

CYBER 175 

2.4 to 5.3 

DEC 

VAX-1 1/780 

0.3 to 0.5 



0.S to 1.2 

IBM 

,1221 

2.1 tj 3.1 



3.6 io 5.4 



1 

UNIVAC 


1 to 1.2 



1 to 1.8 
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For a system whose CPU power range is a to b [abbreviated as (a,b)], 
the CPU requirements for processing the small job subset of the current 
NASTRAN workload are 

(15 U1110 CPU Minutes) x (1/b, 1/a) x (20, 25) 
or 

20 x 15/b to 25 x 15/a = 300/b to 375/a CPU Minutes 

Table 4-1 1 summarizes the CPU and I/O requirements of the small job subset 
of the NASTRAN workload for those systems considered in this study. 


table 4—1 1- Effect of Small Jobs on Candidate Systems 




CPU (Min) I/O (Min) 


o 

CYBER 173 

231 to 469 

ICO to 125 


Q 





(J 

CYBER 175 

57 to 156 

100 to 125 


U 





w 

Q 

VAX-1 1/780 

600 to 1250 

100 to 125 



3031 

250 to 469 

100 to i25 


2 

CQ 

3032 

97 to 179 

100 to 125 



3033 

56 to 104 

100 to 125 



1110 

300 to 375 

100 to 125 


u 





> 

1100/80 

250 to 375 

100 to 125 


z 





Z> 

1100/80 + SAM 

167 to 375 

100 to 125 
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4.2.2 Large jobs 


Because of the nonlinear effect (see Section 2.2) of large NASTRAN 
job processing with respect to a change in structure size, it was necessary 
to develop a mathematical model which could predict this nonlinear behavior. 
This model, which is more specifically a model of NASTRAhTs symmetric 
decomposition activity, is described in Appendix II. Table 4-1 II shows the 
model's predictions for single, non-substructured jobs varying in size from 
10000 dof to 60000 dof . The estimates, expressed in terms of memory, CPU 
and I/O requirements, are given for the considered host systems which are 
modeled as single processor systems essentially dedicated to the processing 
of the indicated NASTRAN jobs. The amount of memory shown in the table 
was selected to be the smaller of: a) the system limit for a single job and 
b) the smallest amount of memory greater than the spill point for the problem. 
For the DEC and IBM systems which utilize the concept of virtual memory, it 
was assumed that the problem was executed in a totally "real" region.* Each 
of these systems, except for the IBM 3033, was assumed to have available 
for a single program, the maximum amount of system memory less 2 Mbytes 
(1 Mbyte = million bytes) which would be available to the operating system and/or 
other programs. Although it can be configured as a 16 Mbyte system, the 
IBM 3033 was modeled as an 8 Mbyte system. 

Except for showing system-specific data; Table 4-1 1 1 is essentially 
a replay of Figures 2-2, 2-3, and 2-4. Noteworthy is the magnitude 

of processing required for the larger NASTRAN jobs and the spill phenomena. 
None of the systems are affected by spill at the 10K dof level. However, for 
the CDC and UNIVAC systems, spill comes into play between the 10K dof and 
20K dof levels. By 30K dof, none of the systems processed the problem with- 
out spill. As reflected by the CPU:I/0 ratios, the advantages of the more 
powerful systems are reduced as the problem size increases. 


* 

Because cf the nature of the SDCOMP algorithm [3J> it is felt that this man- 
ner of execution, realizable in practice, shows the virtual memory system 
in its best light. 
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Table 4-111. Large NASTRAN Job (Non-Substructured) 


=25 

s y 

iructurc Size 

10K 

20K 

30 K 

60K 



Memory 

* CPU 

I/O 

Memory 

CPU 

i/Q 

Memory 

CPU 

I/O 


CPU 

I/O 

u 

a 

CYBER 173 

116K 

2.4 

.2 

130K 

19.4 

4.9 

130K 

66.1 

24.0 

130K 

547 

366 

u 

CYBER 175 

116K 

.2 

.2 

130K 

1.5 

4.9 

130K 

5.4 

24.0 

130K 

49 

386 

DEC 

VAX-1 1/780 

893 K 

3.9 

.2 

2837 K 

30.9 

.9 

6000 K 

104.4 

3.6 

6000K 

839 

52 


3031 

893 K 

2.4 

.2 

2837K 

19.0 

.9 

4000 K 

64.6 

5.3 

4000 K 

521 

73 

03 

I—* 

3032 

893 K 

.9 

.2 

2837 K 

7.1 

.9 

40^0 K 

24.3 

5.3 

4000 K 

196 

73 


3033 

893 K 

.5 

.2 

2837 K 

4.3 

.9 

6000 K 

14.5 

3.4 

6000 K 

117 

49 


1110 

206K 

1.8 

.3 

260 K 

14.6 

5.1 

260 K 

50.0 

24.0 

260K 

416 

371 

< 

> 

1100/80 

206 K 

1.5 

.3 

260 K 

12.2 

5.1 

260K 

41.8 

24.0 

260K 

349 

371 

►— « 
z 

C3 

11 00/80 + SAM 

206K 

1.2 

.3 

260 K 

9.7 

5.1 

260K 

33.1 

24.0 

260 K 

274 

371 


* Memory space is expressed in words (CDC & UNI VAC) and bytes (DECS IBM), 
CPU & I/O are expressed in hours per week. 




Section 2.3.2 introduced the technique of substructuring as an 
alternative to die monolithic model. To model the effect of substructuring on 
the various large job sizes studied, the following assumptions* were made: 


a,- Substructuring would eliminate all spill I/O, 

b) All jobs would be substructured to 5000 dof jobs resulting 
in k (of 2k + 1 jobs, as discussed in paragraph 2.3.2) being 
equal to (dof of structure sizg-f 5000, 

c) The aggregate CPU requirements fen: the 2k + 1 jobs would be 
80% that of what the non-subst ructured model required, and 

d) The aggregate I/O requirements for the 2k + 1 jobs would be 
the total non-spill I/O of the non-substructured job plus two (2) 
minutes of I/O for each of the 2k + 1 submodels. 

Table 4-IV shows the expected system requirements based on die above 
assumptions. Even if assumptions (c) and (d) are disregarded because of their 
softness, the significant reduction in I/O of Table 4-IV over that erf Table 
4-1 II is due mostly to the elimination of spill I/O. Also, not to be overlooked 
is that system memory has ceased to be a critical performance factor. This 
judgement is based on the relatively small amount of memory space required 
by a 5000 dof job and the relatively high CPU: I/O ratios suggesting that it 
tail take only a few such jobs to saturate the CPU and thus effectively, the system. 


Based on the arbitrarily selected submodel size of (b), (a) seems to be a 
reasonable expectation; (c) and (d) were based on discussions with NASTRAN 
users and, in the absence of actual experimentation, should be regarded as 
soft. 
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Table 4-1 V. Large NASTRAN Job (Substructured) 










1 

nsn 

10K 


20K 

30 K 


60K 



CPU* 

1/0 

CPU 

1/0 

CPU 

1/0 

CPU 

1/0 

u 

Cl 

CYBER 173 

1.9 

.4 

15.2 

H 

51.3 

2.3 

M 

8 

u 

CYBER 175 

.14 

.4 

1.1 

■a 

3.9 

2.3 

WM 

8 

DEC 

VAX-11/780 

3.1 

• 4 

24.7 

1.2 

83.4 

2.5 

666 

9 


3031 

1.9 

.4 

15.2 

n 

51.3 

2.3 


8 

A 

3032 

.7 

• 4 

5.7 


19.3 

2.3 

' 

8 • 


3033 

• 4 

.4 

3.4 

■9 

11.6 

2.3 


8 

o 

< 

1110 


.4 

11.4 

B 

38.4 

B 

307 

10 

> 
»— * 

110080 

1.2 

• 4 

9.5 


32.1 


256 

10 

z 

1 1 0(V80 + SAM 


.4 

7.6 

1 

25.6 

1 

205 

10 


*CPU and I/O are expressed in hours per week. 













































4.2.3 Full NASTRAN Workload 


In the above paragraphs, the separate components of the NASTRAN 
workload were discussed for different workload scenarios on the various 
host systems. The purpose of the following paragraphs is to look at the sys- 
tem impact of the full NASTRAN workload for different structure sizes . For 
each of the considered systems, the amount of elapsed time for processing the 
full workload (20 m to 25 m small jobs and 2 to 3 10,000 x m dof large jobs) 
in a multiprogramming environment will be determined. A multiprogr amming 
analysis is generally difficult to accomplish in a simplistic fashion because of 
the resource scheduling which takes place by the operating system . Still 
such an approach is being taken since to do otherwise would be inconsistent 
with the level of detail available with die workload data. 

If the active job mix is taken to be such that no more than a single 
large job and a single small job are active at a given instant, then the number 
of system hours, SYST1ME, for the full NASTRAN workload can be approxi- 
mated as: 

SYSTIME = MAX(TLJCPU.TSJIOT) + MAX(TSJCPU,TL]IOT), 

where TLJCPU is the total amount of CPU time required by the large job 
population, TSJIOT is the total amount of I/O time required by the small jobs, 
TSJCPU is the total amount of CPU time required by the small jobs, and 
TLJIOT is the total amount of I/O time required by the large jobs. The 
assumptions are that the system is fully dedicated to NASTRAN processing 
until the NASTRAN workload is completed and that a sufficient backlog of 
NASTRAN jobs will be available so that a large job and small job will always 
be executing while representatives of either class remain to be processed. 

For non-substructuring, the assumption that only a single large job 
and a single small job will be active is reasonable when one considers that 
memory availability will essentially force this condition. In the case of 
sub structuring, where memory would not force this restriction, SYSTIME, 
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as estimated above, should be regarded as slightly conservative since a 
higher degree of multiprogramming could be accomplished with the smaller 
components of the large job class. In this case, SYSTIME would be closer 
to the sum of TLJCPU and TSJCPU resulting in only a relatively small dif- 
ference when compared to the SYSTIME computation used. 

Being conservative with respect to both the large and small job sub- 
populations and taking 3 large jobs and 25 x m small jobs, as well as the slower 
end of the power range in Table 4-1 • produces the results of Table 4-V. Of 
particular interest is the processing penalty incurred by not employing sub- 
structuring; further interpretation is deferred to Section 5 where the same 
data is looked at in an operational context. 
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SECTION 5 

COST/PERFORMANCE ANALYSIS 

5.0 INTRODUCTION 

From Table 4-V, it is apparent which are the most powerful systems 
for doing NASTRAN processing. It is now important to look at that power 
in terms of cost and to compute the more meaningful system selection criterion 
of cost/performance. Because the U1110 is not a current product offering, 
it will not be a part of the analysis in this section. 

5.1 Acquisition of Computer Configuration for NASTRAN Processing at JSC 

A computer obtained for local use can be acquired by either of two 
means- lease or purchase. The following analysis assumes that the system(s) 
will be purchased. 

Based on the number of hours expected of each system for the different 
workload scenarios, it is a simple matter to compute the system's cost effec- 
tiveness when the cost of the system is known. In order to arrive at the 
needed cost information the following configuration has been selected. This 
near-minimal configuration has resulted from the need to compare the wide 
range of computers considered, and is not necessarily the recommended 
configuration. Final configuration selection should take into consideration 
any other needs expected to be satisfied by the system. 

5.1.1 Hardware 

a) Single CPU 

b) Central Memory 


• 

CDC: 

262 K words 

• 

DEC: 

8 Mbytes 

• 

IBM: 

6 Mbytes (3031, 3032) 



8 Mbytes (3033) 

• 

UNIVAC: 524K words 
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c) Two (2) 9-track magnetic tape drives 

d) Five Hundred (500) Mbytes of online disk storage 

e) One (1) 600 LPM line printer 

f) One (1) card reader 

g) Four (4) CRT terminals 

5.1.2 Software 

a) Standard operating system generated for a low level of 
multiprogramming 

b) FORTRAN compiler and runtime library 

c) Basic system utilities 

d) NASTRAN 

The monthly costs of the various systems, configured as above and 
amortized over seven (7) years, are shown in Table 5-1 which corresponds to 
the non-substrt'ctured approach to problem solution. The costs include only 
hardware, software, and their maintenance. Operational costs such as 
personnel, cost of supplies, energy, floor space, etc. are assumed to be 
equal for all configurations and therefore are not a part of this comparative 
analysis. A system week of 120 hours (instead of 168 hours) is used to allow 
for operating system overhead, preventative maintenance, and quality of 
service. In some cases, as indicated by %SYSTEM, multiple systems can 
result. 


Some points to be made about Table 5-1 are: 

• The better cost/performances of the DEC and IBM systems 

over those of the CDC and UNIVAC systems are due mainly to pro- 
gram memory considerations. 

• At the point where %SYSTEM exceeds 300, that system is 
unable to handle a week's work since a large, nonsubstructured 
job cannot be split among several systems. 
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Table 5-1. Cost/Performance for Non-Substructured Workload 


SYSTEM 

CYBER 173 
CYBER 175 

VAX-1 1/780 

3031 

3033 

3033 

1100/80 
1100/80 + SAM 


— 1 


STRUC) 

fURE SIZE (DOF) 1 


MU1N 1 nL 1 LUo I 

10K 

20K 

3CK 

60K | 



%SYSTEM 

CPF 

%SYSTEM 

CPF 

%SYSTEM 

CPF 

%SY STEM 

CPF 


32539 

12.5 

41 

61.5 

200 

225.2 

733 

2331 

7585 


49892 

3.9 

19 

16.2 

81 

73.6 

367 

1086 

5420 


11916 

27.0 

32 

112.0 

133 

313.1 

373 

2230 

2657 


28019 

12.5 

35 

60.7 

170 

181.2 

508 

1485 

4162 


42258 

4.7 

20 

22.9 

97 

73-8 

312 

672 

2839 


61143 

3.2 

19 

13.6 

83 

44.6 

273 

413 

2528 


26756 

8.9 

24 

43.1 

115 

164.4 

440 

1800 

4817 


28410 

8.2 

23 

36.8 

105 

142.6 

405 

1611 

4577 


%SYSTEM = (No. of System Hours) / 120 

Cost Performance Factor: CPF = (% SY STEM) x (Monthly Cost) / 100 




• Excess capacities result when ^SYSTEM is rounded up to 
a whole number. 

Subject to the same costing considerations as above, Table 5-1 1 
corresponds to the substructured approach to problem solution. The monthly 
costs in that table have changed in some cases to reflect the need for less 
memory. Noteworthy points about Table 5- 1 1 are: 

• The clear supremacy of the CYBER 175 over all above 
competition . 

• A week's work at any workload level can now be handled 
provided enough systems are available. This is due to *he 
ability to now split the large job according to its substructured 
components and simultaneously compute on several systems. 

• Again, excess capacities will result when %SYSTEM is 
rounded up to a whole number. 

5.2 Service Bureau 


Although it has been assumed in the above cost analysis that a system 
would be acquired for local use at JSC, there also exists the service bureau 
option. With a service bureau, the user is more immune to the job turnaround 
problems which can affect the user of local facilities. And commensurate with 
this higher quality of service is a higher cost. Because so many variables 
come into play - accounting algorithms, priority of service, charges for 
permanent file space, terminal connect time, communications costs, remote 
job entry station(s), NASTRAN surcharge, quantity discounts, etc. - it is 
difficult at this level of analysis to judge the cost of using the service bureau 
in lieu of the local computer. Should a service bureau cost be estimated, it is 
important to recall, for purposes of comparison, that Tables 5-1 and 5—1 1 
do not include operational costs which are included in the service bureau 
charges. 
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Table 5-1 1 


SYSTEM 


MONTHLY COST 


CYBER 173 
CYBER 175 

,V AX-1 1/780 

3031 

3032 

13033 

i 

jl 100/80 

1100/80 + SAM 


32539 

49892 

8733 

25715 

40111 

56912 

26756 

28410 


Cost/Performance For Substructured Workload 


| STRUCTURE SIZE (DOF) 

| 10K 

20K 

30K J 

60K 

%SYSTEM 

CPF 

%SYSTEM 

CPF 

%SY STEM 

CPF 

%SY STEM 

CPF 

11.3 

37 

51.0 

166 

147.7 

481 

1064 

3461 

3.9 

19 

7.8 

39 

16.2 

81 

98 

489 

25.1 

22 

96.5 

84 

260.5 

227 

1770 

1545 

11.3 

29 

51.1 

132 

147.9 

380 

1064 

2737 

4.3 

17 

19.3 

77 

55.6 

223 

406 

1627 

3.2 

18 

11.5 

65 

34.7 

198 

252 

1433 

8.2 

22 

34.2 

91 

95.7 

256 

672 

1797 

7.6 

22 

29.4 

84 

79.7 

226 

543 

1544 


%SYSTEM = (No. of System Hours) / 120 

Cost Performance Factor: CPF = (%S/STEM) x (Monthly Cost) / 100 






















SECTION 6 
CONCLUSIONS 

This study has delved into the many facets of the NASTRAN situation 
at Johnson Space Center. The current situation, as well as those which might 
come about in the future, have been addressed. The most immediate conclu- 
sions to be drawn from this study are that: 

• Any growth in structure size will have increasingly nonlinear 
implications in terms of the computer processing requirements 
on any NASTRAN host system. 

• At some workload level, depending on the system, NASTRAN 
model substructuring has a practical value. For the COC 

and UNI VAC systems, this level occurs between 10K and 20K dof 
and for the DEC and IBM systems, before 30K dof. 

• The NASTRAN user can improve his own situation, as well as 
that of the system, by using bandwidth reduction techniques 

• such as BANDIT and WAVEFRONT. 

• While the UNIVAC 1110 does have program addressing limitations, 
these limitations are not responsible for ary current NASTRAN 
problems on the U1110 at JSC, nor should such become a prob- 
lem if substructuring is employed for problem sizes involving 
more than 11,000 dof (see Figure 2-3.) 

• Except for those cases where inordinate I/O (i.e. , heavy spill) 
prevails, the faster mainframes, as shown in Figure 6-1, offer 
the best cost/performance when measured in terms of hard- 
ware, software and maintenance, and assuming that substruc- 
turing will be employed. 
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Size CM) 


Figure 6-1 . Cost/Performance Characteristics of Candidate Systems 


The NASTRAN workload associated with the current structure sizes 
(IS) does not appear to pose a serious problem to the CCF since this workload 
is equivalent to less than 20 SUP hours per week. This opinion applies to 
the U1110 (assuming a solution to the present turnaround difficulties can be 
found) as well as to the replacement system expected during FY82. In the 
unlikely event that a version of NASTRAN does not exist for that replace- 
ment system, then the NASTRAN workload could remain on the U11J j which 
is expected to be retained. 


Should the future NASTRAN workload increase to 2S (2 times current 
structure size), NASTRAN's appetite for computing resources will approximately 
quadruple. While this resulting workload can be done on either the U1110 or 
the replacement system (assuming it hosts NASTRAN), the impact of NASTRAN 
will be felt by the system. In this workload range the mini-computer appears 
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to be a good alternative. The mini's attraction is due to the following 
reasons: 

a) If the mini is user operated, its relative CPF should be less 
than suggested by Figure 6-1 , 

b) An acquisition strategy phased to workload growth can be 
implemented, and 

c) The mini can probably be obtained more quickly than a maxi, 
and thus result in avoiding significant costs if a service bureau 
would otherwise have to be used. 

The attractiveness of the mini continues until seme point before the 
4S workload level. At that point the number of mini's (4 at 3|S, 6 at 4S) 
probably becomes impractical for the users to operate, and thus the minicomputer 
option might be avoided. 

At the 4S workload level, the choice of using the CCF is still viable 
assuming that this much heavier NASTRAN workload can be dealt with 
economically (CPF of replacement system). If such is not die case, or if 
the workload goes beyond the 4S level, then procurement of a large-scale 
dedicated NASTRAN system such as the CYBER 175 (or CYBER 176*) 
appears to be a good choice assuming that any excess capacity can be utilized. 

The workload ot the future cannot be presently estimat ed since it 
will depend on as yet undefined future space structures. The conclusions of 
this report are based on as accurate an understanding of the future workload 
as can be obtained at the present time. It would be a relatively simple matter, 
through use of the approach described and the model used in this study, to 
reassess the situation at a future time should a better definition of the work- 
load become available. 


Even though the CYBER 176 has not been addressed in this report, its ability 
to process NASTRAN jobs is approximately 30% better than that of the CYBER 
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APPENDIX I 

A SIMPLE STATICS PROBLEM 


As an example of the solution of a structural analysis problem, con- 
sider the structure in Figure 1-1 which is made up of a rigid block Con 
frictionless rollers) and four springs whose stiffnesses (in Ibs/inch) are 
a, b, c, and d, as shown. The block is constrained so that it can only more 
left or right. Applying forces , F.- and F^ at the points Q, 0and0, 
respectively, produces displacements of Uj, U 2 and as shown. 


^VA\\\\\\\\\V\\\\\\\\V 



Figure 1-1 Diagram of Structure 
The mathematical formulation of the problem proceeds as follows. 

Since the system is to remain in equilibrium, the following algebraic 
relationships must hold: 

F l - aU 1 - bUj - cU t + cU 2 = 0 
F_ + cU. - cU„ - dU, + dU„ = 0 
. 0 

which is equivalent to: 

F r = (a + b + c) Uj + (-c) U 2 + CO) U 3 
F 2 = C-c) Uj + (c + d) 0 2 + C-d) U 3 

F 3 = (0) U 1 + (_<1) U 2 + Cd) U 3 
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Expressed in Matrix notation, the above set of equations becomes 


F 1 


» 

a +■ b + c -c 

* 

0 


v 

F 2 

= 

- c c ♦ d 

- d 


U 2 

F 3 


0 -d 

d 


U 3 


or F = KU, where F is the Force Vector , K is the Stiffness Matrix , and U 
is the Displaceaent Vector . 
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APPENDIX II 

NASTRAN PERFORMANCE MODEL 


As explained in Section 2, the execution characteristics of NASTRAN 
increase nonlineariy with respect to an increase in structure size. Therefore, 
to study the effects of larger structures it became necessary to represent this 
behavior mathematically. As is often the case with mathematical models of 
complex systems, certain simplifying assumptions are necessary; for this 
model those assumptions are: 

(a) The symmetric decomposition activity represents 75* of the 
total job's requirements in terms of both CPU ami I/O. 

(b) The stiffness matrix, K, is tightly banded. 

(c) The semi-bandwidth (or number of active columns), C, erf K 
is related to the order, N, of K as C = .04 x N. 

(d) Where they were available, the MSC/NASTRAN timing and 
memory characteristics for SDCOMP were used: 

tj = Arithmetic time for multiply-add loop. 

t^ =Time to pack (unpack) one term in a string of nonzero 
matrix terms. 

t^ =Time to pack (unpack) one element in a column of the matrix 

t^ =Average time required to read (write) one block. 

n 1 s Number of words (or bytes) per floating point datum. 

n 2 ^Number of words (or bytes) per integer datum. 

WB^Amount of storage required for each I/O buffer. 

WN— Size of NASTRAN's instruction space including working 
storage . 

Table 1 1— I shows the values used. 
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Table 11-1. SDCOMP 


SYSTEM 

*1 

C^sec) 

*2 

msec) 

*3 

(A* sec) 

— 

BESI 

n l 

n 2 

m 

n 

O 

CYBER 173 

8 

6 

10 

mm 

1 

1 

g||||P| 

25000 

Q 

U 

CYBER 175 

0.6 

1.5 

3 

El ■ i 

1 

1 


25000 

O 

W 

o 

VAX - U/780 (1) 

13 

16 

17 

.044 

8 

4 

9400 

192000 


3031 (1) 

8 

14 

14 

.041 

8 

4 

9400 

192000 

T. 

03 

3032 C1) 

3 

6 

6 

.041 

8 

4 

9400 

192000 


3033 <2) 

1.8 

3 

3 

.041 

8 

4 

9400 

192000 

U 

< 

1110 

6 

5 

11 

.038 

2 

1 

1796 

34000 

> 

t-H 

1100/80 (3) 

5 

5 

10 

.038 

2 

1 

1796 

34000 

z 

1100/80 + SAM C3) 

4 

3 

6 

.038 

2 

1 

1796 

34000 


^Estimated. Compare with IBM 370/158's tj, t 2 , t y WB, and WN - 10, 12, 13, 9400 and 192000, 
respectively. 

^Estimated. Compare with AMDAHL 470/V6's t 1# t 2 , t y WB, and WN - 1.8, 3, 3, 9400 and 192000, 
respectively. 

^Estimated. Compare with UNIVAC 1110's 1 3 , t 2 , t,., WB, and WN - 6, 5, 11, 1796 and 34000, 
respectively. 

(/+) Based on WB and characteristics of devices configured for this study. 

^Five (5) buffers are used. 






























The mathematics of system behavior expressed in terms of CPU, I/O, 
and memory requirements are as follows: 

• The working space, WS, available for symmetric decomposition 
is 

WS = REGION -WN -5 x WB, 
where REGION is the amount of program memory space. 

• If WS < C x Gij + 2n 2 ), then sufficient memory does not exist for 
SDCOMP to execute. 

• If C x (C-i)< 2 x (WS - C x (nj + 2n 2 ))/nj , then SDCOMP can 
execute with no spill. 

• If spill is to occur, the number of rows. S, which can be con- 
tained in a spill group are 

S = c-Vc x (C-l) - 2 x (WS - C x (n 1 +2n 2 ))/n 1 

• Exclusive of spill, the amount of CPU time for SDCOMP is 

CPU | SDCOMpj = N x (0.5 x tj x C 2 + t 2 x C) 

• If spill occurs, increment CPU j SDCOMP J by N x t^ x C^/S 

• The Input/Output activity associated with SDCOMP consists of: 

(a) R J K { * Total time for reading K, the stiffness matrix. 

= t^ x T(N x (N x density) x + header)/WB] 

Where fxl is the smallest integer >x, "density” is the 
ratio of nonzero terms to total terms of K, and "header" 
is the number of words (bytes) representing I/O record 
control information (header = 8 x n 2 *) 
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(b) W | AC, PR J a Total time for writing the active column 

vector/pivotal row file 
= xfN x (C x (rij + n^) + header)/WB] 

(c) R JaC, Pr| s Total time for reading the active column 

vector/pivotal row file 

= w [ac, pr| 

(d) W JrESULTsJs Total time for writing SDCOMP results 

= t^ x Tn x ((C + 1) x nj + header)/WBl 
and should spill occur, 

(e) W J SPILL J = Total time for writing to the spill files 

= t 4 x ftr C/S - ll x rc/Sl x (S/2) 

+ ( fN/S] - fC/Sl ) x (C- D) x C 
x (n.^ + n 2 )/WBl 

R J S PILL | = Total time for reading from the spill files 

= W | SPILL | 

(0 The total I/O for SDCOMP is 

IO |sDCOMp|= R JkJ + W |AC, Pr| + R |aC, Pr| 

+ w (results] + w|spill|+ r [spill 




Finally, based on the 75% relationship of SDCOMP to total job. 


cpu [job] 
io [job] 


= (4/3) x CPU | SDCOMP 
= (4/3) x 10 [SDCOMP] 


44 



The above algebra plus some of the other calculations involved in 
this study were coded as a FORTRAN program to facilitate computation. 
Documentation of that program is provided on the following pages in the 
form of: 

(a) Program Symbol Dictionary 

(b) Program Listing 

(c) Sample Input 

(d) Sample Output 
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SYMBOL DICTIONARY 


•e'- 

er! 


BIGREG 

BYPASS 

C 

CF1 

CF2 

COST 

CPFH 

CPF L 

CPU 

cs 

DENSTV 

GSI 

HCPU 

HEADER 

HIOT 

HOSTS 

t 

JOB 

IOT 

JOB 

JOBCPU 
JOB IOT 
JOBWCT 
N 

NAME 

NLH 

Nil 

NSREGN 

nsys 

N1 

N2 

REGION 

HIN 

RKLL 

RSI 

S 

SJGPH 

SJCPl 
SOI OH 
SJIOL 
SPILL 
SOTARG 

ss 

ss$ 

SUFH 

S'JFL 

SYSTEM 

T 1 

T2 

T3 

T 4 

MB 

MIN 

MLLL 

MN 

UGSSS 


•BIG REGi r ** • * USED FOR ELIMINATION OF SPILL. 

PRINT SWITCH. T RUE * *> PRINT SYSTEM CHARACTERISTICS 
NUMBER OF ACTIVE COLUMNS. 

CONVERSION factor. RATIO OF JOB CPU TIME TO SOCOMP CPU TIME 
CONVERSION FACTOR. RATIO OF JOB I/O TIME TO SDCOMP I/O TIME 
COST OF SPECIFIED system 

cost performance factor for system, high end of range. 

COST PERFORMANCE FACTOR fop system, com end of range. 

SDCOMP CPU TIME (IN SECONDS.) 

MAXIMUM NO. OF ACTIVE COLUMNS WHICH CAN BE HELD {N MS WITHOUT SPILLING. 

DENSITY OF STIFFNESS MATRIX. 

G$I(X) . SMALLEST INTEGER . GE . X 
JOB CPU TIME EX TRESSED IN HOURS. 

NO. OF WOROS(BVTES) REPRESENTING 1/0 DATA BLOCK CONTROL INFORMATION. 

JOB I/O TIME EXPRESSED IN HOURS. 

NAMELIST PROVIDING runtime OPPORTUNITY FOR MOOIPYlNQ SYSTEM CHARACTERISTICS 
.‘OF 1 Y.'ALL CLOCK TIME IN «0"PR- FO'JALS HCPUtHIOT. 

NO. OF 1/0 BLOCKS TRANSFERRED DURING SDCOMP. 

SDCOMP I/O TIME (IN SECONDS). BASED ON 10B A T4. 

NAMELIST USEO FOR DESCRIBING mooel experiments. 

LARGE JOB CPU time (IN SECONDS). 

LARGE JOB I/O TIME (IN SECONDS I . 

LARGE JOB wall CLOCK TIME (IN SECONDS). EQUALS JOBCPU+JOBIOT . 

NO. OF DOF IN STIFFNESS MATRIX. 

alphanumeric identification of system. 

HIGH END OF RANGE FOR NUMBER OF LARGE JOBS. 

LOW END OF RANGE FOR NUMBER OF LARGE JOBS. 

APPROXIMATE MINIMUM MEMORY REGION RESULTING 1H NO SPILL 

NO. OF SYSTEMS REPRESENTED BY MODEL 

NO. OF WOROS(BYTES) PER FLOATING POINT DATUM. 

NO. OF WORDS) BYTES I PER INTEGER OATUM. 

USER IMPOSED MEMORY REGION SIZE. 

NO. OF BLOCKS ft FAD FROM FILE CONTAINING ACTIVE COLUMN/PI VOTAl ROWS. 

NO. OF BLOCKS READ FROM STIFFNESS MATRIX FILE OURING SDCOMP 
NO. OF BLOCKS READ FROM SPILL FILE DURING SDCOMP. 

WHEN SPILL OCCURS. S IS THE NUMBER OF ROWS WHICH CAn BE CAN BE CONTAINED IN A SPILL GROUP. 
HIGH tNU OF RANGE FOP SMALL JOb CPU TIME (IN KINjTIS). 

10W END OF RANGE FOR SMALL JOB CPU TIME (IN ,'MNUTES). 

HIGH END of range FOR SMALL JOB I/O TIME (IN MINUTES). 

LOW END OF RANGE FOR SMALL JOB I/O TIME (IN MINUTES). 

LOGICAL FLAG. TRUE *«> SPILL IN EFFECT FOR THIS PROBLEM AND SYSTEN. 

TEMPORARY VARIABLE. USED FOR TESTING SIGN OF CU»NT 1 TV BEFORE TAKING SOUARE HOOT. 

LOGICAL VARIABLE. TRUE ■ «> PROBLEM IS SUBSTRUCTUREO . 

COST OF SYSTEM ON WHICH SUBSTRUCTUREO WORKLOAD IS PROCESSED 
HIGH END OF range FOR SYSTEM UTILIZATION FRACTION. 

LOW END OF RAfiGE FOR SYSTEM UTILIZATION FRACTION. 

NUMERICAL INDEX CORRESPONDING TO COMPUTER SYSTEM MODELED. 

TIMING CHARACTERISTIC OF MOOELEO SYSTEM. SEE TABLE I X - 1 

TIMING CHARACTERISTIC OF MODELED SYSTEM. SEE TABLE II - 1 

TIMING CHARACTERISTIC OF MODELEO SYSTEM. SEE TABLE 1 1 - 1 

TIMING CHARACTERISTIC OF MODELEO SYSTEM. SEE TABLE I Z • I 

SYSTEM'S I/O BUFFER SIZE (IN WOROS( BYTE S I ) 

NO. OF BLOCHS WRITTEN TO ACTIVE COLUMN/PIVOTAL POM FILE 
NO. OF BLOCKS WRITTEN TO FINAL OUTPUT FILE, 

SIZE OF NASTRAN INSTRUCTION SPACE. (IN WORDS! BYTES ) ) 

COST OF SYSTEM CONFIGURED FOR NON-SUBSTRUCTURlNC 



MS 

wso 

XN1 

XN2 

XT1 

XT2 

XT3 

XT4 

XMS 

XWN 

2 


WORKING storage AVAILABLE FOR sdcomp SCRATCH pad 
NO. OF BLOCKS WRITTEN TO SPILL FILE. 

ARRAY FROM WHICH N1 I S OBT A INEO . N1 *XN1 ( SYSTEM) 
ARRAY FROM WHICH N2 IS OBTAINED. N2=XN2 (SYSTEM) 
ARRAY FROM WHICH T1 IS OBTAINED. T 1 * XT 1 ( SYSTEM ) 
ARRAY FROM WHICH T 2 IS OBTAINED. T2*XT2 I SYSTEM) 
ARRAY FROM WHICH T3 IS OBTAINED. T 3* XT3( SYSTEM) 
ARRAY FROM WHICH T4 IS OBTAINED. T4*XT4 ( SYSTEM) 
ARRAY FROM WHICH WB IS OBTAINED. WB«XWB ( SYSTEM ) 
ARRAY FROM WHICH WN IS OBTAINED. WN*XWN ( SYSTEM) 
FLOATING POINT EQUIVALENT OF S. 



NASTRAN COST/PERFORMANCE CALCULATIONS 
CUNIVAC FORTRAN V) 


t 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

4S- 26 

Co 27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 
58 


COMPILER (GEN-LI8RV) 

IMPLICIT j NT EGER ( A -Z) 

PAR AME TER NS YS = 8 ,8 I GREG* 1 000000000 

DIMENSION XWBINSYS > .XN1 < NSYS) . XN2(NSYS) .XT4(NSYS I . X NN ( NSYS) . 

XT 1 (NSYS 1 .XT2< NSYS) . XT3(NSYS) .NAME) 3.NSVS) . 

WOSS$< NS YS) . SSSfNSYS ( .SdCPLINSVS I .SdCPH(NSYS > 
real XT1.XT2.RT3.XT4.T1.T3.T3. T4.Z.0ENSTV. CFl .CF2.SUFL .SUFH. 

C PEL. CP FH. JOS CPU. JOBIOT. JOBWCT.HCPU.HI0T.MWCT.SJJO1.50I0H 
LOGICAL BYPASS. SPl LL.SS 
DATA BYPASS/ T/ SS/ F/ 

DATA CfI/i .333/ CF 2/1 . 333/ NLL/2/ NLH/3/ 

DATA ( SJCPL ( d).SdCPH< d ) . d=1 . NSYS)/ 

231.469. 57.156. 600.1250. 250.469. 97.179. 

56.104. 250-375. 167.375/ 

DATA SdIOL/1 00./ Sd IOH/ 1 25 . / 

DATA (I NAME! d.I ) ,d«1 .3) . NOSSSI I ) . SSS ( I ) .XT 1 ( I ) .XT2( I > . 

XT3 ( I ) . XT4( I f.XNIt It . XN2/ 1 > ,X*N< I) . X *B( I J . I * 1 . NS YS) / 

- 'COC CYBER 1 73 ’ . 32539 . 325 39 . 8 . 0 . 6 . 0 . 1 0 . . . 050 1 . 1 . 25000.1792. 

• ‘CDC Cyber 1 75 ' .49B92. 49892 , 0 . 6 . 1 . 5. 3 . 0. ,050. 1 . 1 . 25000.1792. 

- 'DEC VAX- 1 1/780 '.11916. 8733 . 1 3 . . 16 . . 1 7 . . . 044 .8 , 4 . 192000 . 9400 . 

- ' IBM 3031 '.28019.25715.8.0. 14 . . 1 4 . . . 041 .8 . 4 . 192000.9400. 

-'IBM 3032 ' .42258. 401 1 1 .3 .0 .6.0.6 .0. .041 ,8,4. 192000.9400. 

-‘IBM 3 c 33 ' .61 143. 569 12. 1 .8 .3.0.3 . 0, .041 .8,4. 192000.9400. 

■ 'ONI VAC 1 100/80 ' .26756.26756.5.0 .5.0. 1 0. . .038.2, 1 . 34000.1796. 

- 'UNIVAc 1100 / 80 +SAM' .28410.28410.4.0.3.0.6.0, .038,2,1 . 34000.1796/ 
NAME 1 1 ST/H05 TS/XT1 . XT2 . X T3 . X T4 . XN 1 . XN2 . XWN . *W8 . 

MOSSS. SSS. SdCPL .SdCPH. BYPASS 
NAME LIST/ JOB /SYSTE M.REGI ON. C. N .OENSTY.SS 
DEFINE GSKX )*INT< X+0. 999999991 
READ! 5 HOSTS .END*300) 

I F ( BYPASS I GO TO 200 
PRINT 1000 
CO 100 1*1. NSYS 

PRINT i 100. < NAME(d . I) ,d*l .3) . I .XTK I ).XT2( 1 ) . XT3( I ) , XT4I I ) . 

XN1 ( I ) . XN2( 1 1 . XWN( 1 ) , XMB( X I 

loo ccf:::i."dc 

200 FRINT 1200 

30C READ < 5 dOR.E ND*400 ) 

IF( SYSTEM EQ.O) GO TO 300 
IF(SS) REGI0N*8IGREG 
T 1 = XT 1 | SYST£M)*1£- 6 
T2=XT2 ( SYSTEM)* IE- 6 
T3*XT3 t SYSTEM)*1E- 6 
T4* XT4 1 SYSTEM) 

N1*XN1 (SYSTEM) 

N2*XN2 ( SYSTEM) 

HEADER *B* N2 
NN=XWN ( SYSTEM) 

NB=XWB(SYSTE») 

NSREGN.1 01* (N1*C* (C-1 )/2.-YNN-Y5*WB-t-C*(«1'»2*N2)) 

I F ( NOT.SS) REGION oMINOI REGION. NSREGN) 

NS«REGI0N-NN-5»NB 
S0TARG«2*( *$-C*(N1 +2*N2 ) )/N1 
IFISOTARG.LT .0) CO TO 500 
CS=S0RT(SQTARG) 

s-o 



57 

£8 

59 

SO 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 
81 
82 


85 

86 

87 

88 

89 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 
100 
101 
102 

103 

104 

105 

106 

107 

108 

109 

110 


SOI A RG =C* I C - 1 )-CV ' 2 

1 F( SOT aRG.GE . 0 ) S*C-S 0 RT<SQTARG> 

Z=M£X 0 ( 1 .SI P REAL l* MEANS Z = S. AVOIDING OPTIMIZER PROBLEMS 
SPI LL. . FALSE 
I F l S.NE . 0 ) SPILL*. TRUE . 

N* 0 . 5 * N 1 ’C** 2 

CPU = N ' | 0 . 5 *T 1 *C'* 2 +T 2 *C) ♦ 0.5 

IFlSPl LL) CPU*CPU+N*T 3 *C** 2 /Z 

RKLL=G$I( N*( N'DENS TY*N 1 *HEA 0 ER I /float (MB I > 

WIN- G$ i(IN'(C*(N 1 +N 2 ) +HE AOE R ) ) /FLOAT (W 8 ) ) 

R IN s MI N 

WLl L = G$I I ( N* ( ( C + 1 ) -N 1 +HEADER) ) /FLOAT (MB)) 

*S 0 = 0 

I F( SPILL) WS 0 *G$II ((GSI(C/Z)* 1 ) *G$ 1 ( C/Z ) • S/ 2 . ♦ 

(GSI (N/Z)-GS 1 (C/Z)>»(C* 1 ))*C*<N 1 +N 2 >/FL 0 AT<MB>) 

RSI rWSO 

10 ? - RK LL+WIN+RIN+W LLL+WSO+fiSI 

If - ■'A . A. 0.5 

UOECPUrCFI -CPU 

IF(SS) J 06 CPU = 0 BG'JOBCPU 

JOB I OT iCF 2 * I OT 

IF(SS) JOeiOT*CF 2 - IOT ♦ ( 2 *N/ 5000 . ♦ 1 ) * 1 20 . ♦ 120 « 2 MINUTES OF I/O 
JOEWCT = JOECP U+JOB I OT 

SUF L * I WAX ( NL L'JOBC PU/ 60 . .FLOAT ( N ) /I 0000 ‘SJ IOL I ♦ 

maxi float iM / ioooo*sjcpl( system i . nll *uobiot/ 6 o. ) >/ 72 . 

C ... ABOVE S BELOW: / 72 . ..> ( / 60 / 1 20 ) • 1 00 . WHERE 60 IS USED TO 

c ... CONVERT MIN TO HR. 120 TO ADJUST TO 120 -HR SYSTEM MEEK , AND 

C ... 100 is USED TO CONVERT TO A PERCENTAGE. 

SUF H - I MAX 1 NL H * JOBC PU/ 60 . . FLOAT ( N ) / 1 0000 * SJ IOH )♦ 

MAXI FLOAT(N) / 10000 *SJCPH(SYSTEM),NLH»J 0 BI 0 T/ 60 . ) )/ 72 . 

COST^MCSSSISTSTEMi 
I F I 5 S 1 COST. SSS( SY STEM) 

CPF L* I ( SUF L/ 100 . | - COST )/ 100 . 

CPTH.i 1 SUFH/ 100 . ) * COST)/ 100 . 

HIOT.JOBIOT/ 3600 - 
HCPU=OCBCPU/ 3600 . 

HWCT.JOBWCT/ 3600 . 

«RI I t. c.iuuo ) ( NAM £(J .SYSTEM) . 0 * 1 , 3 i . COST . N .C . REGION. 

5 UFL .cpfl.sufhcpfh.hcpu.hiot.hnct 

GO TO 300 
400 STOP 

= 00 WRITEl 6.1500 ) 

GO TO 300 

1000 FORMAT I ' 1 SYSTEM CH ARACTE R 1 ST I C S : ' /// 20 X . ' I D T» T 2 '. 

T 3 T 4 N 1 N 2 WN MB '//) 

1100 FORMA* ( 1 . 1 . 3 A 6 . 13 .F 10 . 1 .F 6. 1 .F 7 . 1 . F 9 . 3 . 17 . I 7 . 19 , IB) 

1200 FORMAT ( ////T 24 . ' COST' , 5 X . 'N ' , 5 X. 'C' . 8 X. ' REGION' . 7 *, ' SUFL' . 
4 x.'CPFL' . 7 X . ' SUFH' , 4 X , 'CPFH' . 14 *. • LCPU' . 4 X. 

' L I OT ' . 4 X . ' LWCT ' // ) 

1300 FORMAT ( T 6 . ' SYSTEM REGION C N' /I X 3 A 6 . 1 8 . IS. 17 ) 

1500 FORMAT ( 'OREG ION SIZE TOO SMALL FOR EXECUTION.') 

200 FORMAT ( IX, 3 A 6 ,' S ' . 1 5 . 1 6 . 16 . 3 X . 19 . F 10 . T . ' V . F 8 . 0 . FtO . 1 , 

, F 8 . 0 . 10 X. 3 F 8 . 2 ) 

END 



L,. 


CO 

T) 

O 

O 

SO 


f 



SAMPLE INPUT 


XI 4669* 

T PF*( 1 ) 

. experiment 


1 

SKOSf 

S ByPAbS» . F . S 


3 

sjoe 

N. 1C00o,Ci400. DENS TV., 003. SYSTEM. 1 .REGION. 1 18000 

S 

3 

SJOB 

SYSTEM.2 S 


4 

$008 

REGION *900000. SYST em«3 S 


s 

SJ03 

SYSTEM. 4 $ 


6 

SJOB 

SYSTEM. 5 S 


7 

SdOB 

SYSTEM *6 $ 


9 

SdOB 

REG I ON. 21 0000 .SYST EM. 7 S 


9 

*JOB 

SYSTEM. 8 S 


10 

SdOO 

system.o.ss.t S 


t . 

Sjqb 

N> i 0000. C> 400. DENS TV*. 003. SYSTEM. 1 .REGION- 1 16000 

S 

12 

SJOB 

SYSTEM.2 S 


13 

SJOB 

REGION. 900000. SYSTEM.3 S 


1 » 

SJOB 

SYSTEM. 4 S 


15 

SJOB 

SYSTEM. 5 S 


16 

SJOB 

SYSTEM. 6 S 


17 

SJQ 8 

REGION. 210000. 1YST tM«7 t 


16 

SJOB 

SYSTEM. « S 



U1 

O 



SAMPLE OUTPUT 


SYSTEM CHARACTERISTICS: 



ID 

T 1 

T 3 

73 

T4 

Ml 

M3 

MM 

WB 

COC CYBER 1 73 

1 

o.o 

6 .0 

10.0 

.050 

1 

1 

35000 

1793 

COC CYBER 1 75 

3 

.6 

1 .5 

3.0 

.050 

1 

1 

35000 

1793 

DEC VAX- 11/700 

3 

13 .0 

16 .0 

17.0 

.044 

• 

4 

1 93000 

9400 

IBM 3031 

4 

8.0 

14 .0 

14.0 

.041 

• 1 

4 

1 93000 

9400 

IBM 3033 

5 

3.0 

6 .0 

6.0 

.041 

8 

4 

1 93000 

9400 

IBM 3033 

6 

1 .8 

3.0 

3-0 

.041 

8 

4 

1 93000 

9400 

UNIVAC 1100/80 

7 

3.0 

S .0 

10.0 

.038 

3 

1 

34000 

1798 

UNIVAC 1 100/80* SAM 

8 

4.0 

3 .0 

6.0 

.038 

3 

1 

34000 

1798 



COST 

N 

c 

CDC CYBER 1 73 

S33S39 

10000 

400 

COC CYBER 1 75 

*49893 

10000 

400 

DEC VAX- 1 1/ 780 

*11 916 

10000 

400 

IBM 3031 

*38019 

10000 

400 

1EM 3033 

SY32S9 

10000 

400 

IBM 3 033 

*61 143 

10000 

400 

UNIVAC 1100/80 

*36 756 

10000 

400 

UNIVAC 1 1 OO/BOtSAM 

JOB 4 1 0 

10000 

400 

COC CYBER 1 73 

*33639 

10000 

400 

CDC CYBER 1 75 

*49893 

10000 

400 

DEC VAX- 11/780 

* 6733 

10000 

400 

IBM 3031 

*35715 

10000 

400 

IBM 3033 

*40111 

10000 

400 

IBM 3033 

*56913 

10000 

400 

UNIVAC 1 100/80 

*36756 

10000 

4Q0 

UNIVAC 1 100/80+SAM 

*38 410 

inooo 

4Q0 


(1) ’’**********" indicates substructuring 


REGION 

sun 

CRFl 

SUFH 

1 16109 

7.3* 

33. 

13.5* 

116109 

3.3X 

11 . 

3.9* 

893638 

14.8* 

18. 

37.0* 

893638 

7.5* 

31 . 

13.5* 

893638 

3.8* 

12. 

4.7* 

893638 

2.3* 

13. 

3.3* 

306331 

6.0* 

16 . 

8.9* 

306221^ ^ 

4.3* 

13. 

8.2* 

8.4* 

31 . 

11.3* 


3.2* 

11 . 

3.9* 


13.5* 

13. 

35.1* 


6.7* 

17. 

11.3* 


2.7* 

11 . 

4.3* 


2.2* 

13. 

3.2* 


8.5* 

18. 

9.3* 


3.8* 

11. 

7.8* 


CMH 


LCRU 

HOT 

LNCT 

41 . 


3.38 

.31 

3.59 

19. 


.19 

.31 

.39 

32. 


3.87 

.23 

4.10 

35. 


2.39 

.31 

3.60 

30. 


.90 

.31 

1.11 

19. 


.84 

.31 

.75 

34. 


1.49 

.39 

1.75 

33. 


1.19 

.36 

1 .45 

37. 


1.90 

.36 

3.38 

19. 


. 14 

.38 

.53 

33. 


3.10 

.40 

3.49 

29. 


1.91 

.39 

3.39 

17. 


.73 

.39 

1.10 

18. 


.43 

.38 

.81 

33. 


1.19 

.43 

1 .63 

33. 

C VO 
2% 

£ ,*-< 

.99 

.43 

1.38 
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